MULTISPEECH - 2016 - Annual activity report

MULTISPEECH

MULTISPEECH - 2016

Project-Team Multispeech

Members

Overall Objectives

Research Program

Application Domains

Highlights of the Year

New Software and Platforms

New Results

Bilateral Contracts and Grants with Industry

Bilateral Contracts with Industry

Partnerships and Cooperations

Dissemination

Bibliography

Previous |

Home | Next next

Section: New Software and Platforms

KATS

Kaldi-based Automatic Transcription System

Keyword: Speech recognition

Functional Description

KATS is a multipass system for transcribing audio data, and in particular radio or TV shows. The audio stream is first split into homogeneous segments that are decoded using the most adequate acoustic model with a large vocabulary continuous speech recognition engine. In this new software, the recognition engine is based on the Kaldi toolkit, and uses Deep Neural Network - DNN - based acoustic models. An extra processing pass is run in order to rescore the $n$ -best hypotheses with a higher order language model.

Participants: Odile Mella, Dominique Fohr and Denis Jouvet
Contact: Dominique Fohr
URL: Available online on the A||go platform: https://allgo.inria.fr/app/loriasts_kaldi

Previous |

Home | Next next